Task-Driven Progressive Part Localization for Fine-Grained Object Recognition
نویسندگان
چکیده
منابع مشابه
Fine-Grained and Layered Object Recognition
This paper presents a novel research on promoting the performance and enriching the functionalities of object recognition. Instead of simply ̄tting various data to a few prede ̄ned semantic object categories, we propose to generate proper results for di®erent object instances based on their actual visual appearances. The results can be ̄ne-grained and layered categorization along with absolute or ...
متن کاملPart Localization using Multi-Proposal Consensus for Fine-Grained Categorization
We present a simple deep learning framework to simultaneously predict keypoint locations and their respective visibilities and use those to achieve state-of-the-art performance for fine-grained classification. We show that by conditioning the predictions on object proposals with sufficient image support, our method can do well without complicated spatial reasoning. Instead, inference methods wi...
متن کاملUnsupervised Template Learning for Fine-Grained Object Recognition
Fine-grained recognition refers to a subordinate level of recognition, such as recognizing different species of animals and plants. It differs from recognition of basic categories, such as humans, tables, and computers, in that there are global similarities in shape and structure shared cross different categories, and the differences are in the details of object parts. We suggest that the key t...
متن کاملTaxonomic Curriculum Learning for Fine-Grained Object Recognition
Drawing inspiration from taxonomic methods of learning and curriculum learning, we propose a new learning method: taxonomic curriculum learning. With this method, given a visual taxonomy of finegrained classes, a classifier is trained in a sequential coarse to fine manner. We compare our method with two baseline methods: the standard approach of purely fine-grained learning and multi-task learn...
متن کاملFine-Grained Recognition without Part Annotations: Supplementary Material
In the main text we showed that large gains from using a VGGNet [5] architecture on the CUB-2011 [6] dataset. We show a similar comparison on the cars-196 [3] dataset in Tab. 1. As before, using a VGGNet architecture leads to large gains. Particularly striking is the gain from fine-tuning a VGGNet on cars-196 – a basic R-CNN goes from 57.4% to 88.4% accuracy only by fine-tuning, much larger tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2016
ISSN: 1520-9210,1941-0077
DOI: 10.1109/tmm.2016.2602060